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ABSTRACT 


The objective of this project is to analyse VLF/ELF 
or speech signals in real time for its various frequency 
components using digital signal processing techniques. 
Fourier transform is used as a useful tool in extracting 
information in speech signals. Analog methods have limitei 
resolution and flexibility. Disadvantages are overcome by 
taking Discrete Fourier Transform (DFT) FFT is an efficiei 
way of realizing DFT. 

The FFT is computed through the system TM 990/189 
microcomputer. It consists of TFiS 9980 CPU and TMS 9901 
PSI (Programmable System Interface). The analog signal of 
low frequency is converted into digital signal which is 
stored in memory of the system through PSI. The output 
points of FFT stored in memory, are converted into analog 
signal and displayed on the oscilloscope. Thus the spectrun 
of analog signal in VLF/ELF range can be analysed. 
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CHAPTER 1 

INTRODUCTION 

Analysis of signals in the frequency domain is an 
important measurement concept which is widely used for 
providing electrical and physical system performance 
informations# There are many instances when signal 
processing involves the measurement of spectra. For example, 
in speech recognition problems, spectrum analysis is usually 
a preliminary to further acousting processing. In speech 
bandwidth reduction systems, spectral analysis is generally 
the basic measurement. In sonar systems, sophisticated 
spectrum analysis is required for the location of surface 
vessels and submarines. In radar systems, obtaining target 
velocity information generally implies the measurement of 
spectra. Thus spectrum analysis encompasses a great variety 
of different measurements. Several types of instruments 
are used for frequency response signal analysis e,g, spoctruri 
analyser, digital Fourier analysers, wave analysers, distor- 
tion analysers and modulation analysers. 

Each of the instruments measures basic properties of a 
signal in the frequency domain, but each uses a different 
technique. The Fourier analyser uses digital sampling and 
transformation techniques to form a Fourier spectrum display 
that has amplitude as well as phase information. Our project 
also aims at analysing the spectrum of speech or VLF/ELF 
signals for real time applications. 
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In spite of the advantages of digital signal analysis, 
analog methods are used for spectrum analysis at higher 
frequencies. The reason for this widespread use of analog 
methods for making this class of measurements is the ease 
of construction of narrow band analog filters for reaso- 
nably high frequencies. To carry out the same operation 
digitally the input signal must first be sampled, and 
then each sample is converted to its digital equivalent. 

If the input signal is sampled at uniformly spaced 
instances, the sampling rate must be at least twice the 
highest frequency present in the analog input signal in 
order to avoid errors in the measurement. The high sampling 
rate required by this method makes the cost of the sample 
and hold and ADCs which must be used in the implementation 
of such methods prohibitive. These requirements make the 
spectrum analysis increasingly difficult at higher frequen- 
cies, Our project is the spectrum analysis of speech or VLF/ 
ELF signals. 

The algorithm used for the spectrum analysis is the 
Fast Fourier Transform (FFT). We have selected radix-2, 
in-place algorithm. The processer to compute FFT can be 
designed using multipliers; bit-slice microprocessors or 
ordinary microprocessors. The design of processor with 
multipliers involves more hardware. in the design of 
processor with bit-slice microprocessors, it is a difficult 
to write microprogram. Therefore we chose the ordinary 
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microprocessors. The 16-bit IMS 9980A microprocessor was 
selected for our project. 

To take the input samples, a data acquisition system 
is designed. This system samples input analog signal and 
converts them into digital form and are stored in the 
system memory. The microcomputer does the FFT and output 
points are displayed on the oscilloscopes. The hardware 
system for display is designed. The number of points and 
scans are displayed on the Y-axis and X— axis respectively 
of the oscilloscope. The magnitude of FFT points are sent 
to the Z-axis of oscilloscope. Thus the magnitude of 
spectrum can be seen in the form of intensity on the screen. 

The aim of the present work is to present a compact 
visual display of the spectral characteristics of signals 
in the VLF/ELF range. Such a display enables one to 
readily pick out features of the signal. An application 
is the measurement of magnetospheric electron densities 
using whistlers. Whistlcisare the result of the dispersion 
resulting from the propagation of lightering strokes or 
atmospherics through the magnetosphere. A spectrogram of 
a whistler is shown in Fig 1.1. The minimum propagation 
delays give the electron densities along various paths- 
while the corresponding frequencies give the paths 
themselves. 
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Fig 1,1 Spectrogram showing multipath whistlers 

recorded at eights station. The spectrogram 
shows several traces or components all of 
which originated from an atmospheric at 0 sec. 
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CHAPTER 2 

FAST FOURIER TRANSFORM 

FFT is an algorithm (i-e. a particular method of 
performing a series of computations) that can compute the 
discrete Fourier transform much more rapidly than other 
available algorithms. Simple calculations show that 
computing DFT of N complex points requires ii complex multi- 
plications. The FFT owes its success to the fact that the 
algorithm reduces the number of multiplications and additions 
required in the computation of DFT. 

An original algorithm developed by J.W. Cooley and 
J.W. Tukey reduces the computational load to N logg N 
(complex multiplications), where B is the base (the base 
may be 2, 4, 8 or 16), 

There have been some variations (like Sande-Tukey 
algorithm) to the original algorithm and at present there 
are many algorithms available for calculating the FFT. There 
have been many additional modifications depending on the 
particular requirements, the limitations of the available 
hardware, and the ingenuity of the individual designer or 
programmer. 

Most of these algorithms may be classified as either 
(a) in place or (b) natural input-output [9] 
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2.1 IN PLACE ALGORITHM 

An in-place algorithm is one in which a given component 
of any intermediate vector may be stored in the same location 
occupied by the corresponding component of the preceding 
vector. This type of algorithm requires less total storage, 
but at the same time the computational time is higher than 
that required for natural input-output algorithm. In this, 
either the output spectrum appears in an unnatural order or 
they require that the input data be arranged before entering 
the computation array. 

Thus in-place algorithm requires the reordering of input 
or output data. The process in which the input points are 
reordered from natural to unnatural form is called scrambling. 
On the other hand the process in which the output points are 
reordered from unnatural to natural form is called unscram- 
bling. In our system we need unscrarbling operation of 
output points. 

2.2 SIGNAL FLOW GRAPH 

FFT signal flow graph for in-place algorithm for N*8 
is shown in Fig 2.1. It consists of data array and compu- 
tational arrays. Data array is represented by a vertical 
column of nodes on the left of the graph. The vertical 
columns to the right of the data array correspond to compu- 
tational arrays and in general there will be r computational 
arrays where r * Log 2 W 
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2.3 BASIC PROPERTIES OF IN-PLACE ALGORITHM [l ] 

From the flow graph, the following basic properties 
can be identified : 

(i) Number of computational arrays 
r = Log 2 N 

(ii) Each array requires complex multiplications and 
and N complex additions. Hence the total number 
of multiplications and additions required are 
(N/2) Log 2 N and N Log 2 N respectively. The ratio 
of direct to FFT computation time is 

_ 2N 

N/2 Log2N ~ Log2N 

(iii) The computation of a dual node pair requires only 
one multiplication and two additions. If the 
weighting factor at one of the nodes in a dual node 

p 

pair is W , then the weighting factor at the other 
node of the pair is 

then 

x^{k + N/2 ) = (*<)'* ) 

*tih 

here x (k) indicates k component in the £ array. 



9 


(iv) The epacing between dual node pairs differ from 

array to array. In the array (jC = 1, 2 , ..r), 

the spacing is N/2 * i.e. x(k) and x{k+N/2 ) consti- 
tute a dual node pair. 

(v) To evaluate the value of P, the exponent of V/, for 
any index in a given array, the following procedure 

I'll 

is followed. Represent k, the node index in the J? 

array, in binary form with r bits, retain the most 

significant bits and add (r-£) leading zeros to 

form a r bit binary number. Reverse the bit order 

of the resulting number. The decimal equivalent of 

the final binary number gives the index P. The 

i#H "trH 

weighting factor for the k node of the £ array 
IS W . 

(vi) The output after r arrays is in scrambled form. 

To unscramble the output x{k), write the index k 
in binary form with r bits and reverse the bit order. 
The resulting decimal number is the index n of x(k). 
The unscrambling procedure is shown in Fig 2,2. 

2.4 NATURAL INPUT-OUTPUT ALGORITHM 

Natural input-output algorithm is one in which a given 
component of any intermediate vector may not be stored in 
the same location occupied by the corresponding component 
of the preceeding vector, thus requiring the extra memory 
for storing the intermediate result. An N point FFT 
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(N real and N imaginary) will require 4N words of memory 
as compared to 2N words required for in-place algorithm. 

These algorithms, of course, maintain the natural input- 
output order and thus do not require scrambling/unscrambling 
at the input/output levels, and as such are faster as 
compared to the in-place algorithm. In our system, in-place 
algorithm has been used for it requires less number of 
memory locations, 

2.5 DECIMATION-IN-TIME/FREQUENCY [?] 

The decimation-in-time FFT algorithms are all based 
upon the decomposition of the DFT computation by forming 
smaller and smaller subsequences of the input sequence 
x(k}. Alternatively we can consider dividing the output 
sequence, x(n), into smaller and smaller subsequences in 
the same manner. The class of FFT algorithms based on this 
procedure is commonly referred to as decimation in frequency, 

2.6 FFT ALGORITHMS FOR REAL DATA [l] 

In applying FFT, we often consider real function of time 
whereas the frequency functions are, in general complex. 

Thus a single computer program can be written to determine 
both the discrete transform and its inverse such that a 
complex time waveform is assumed. 

If the time function being considered is real, we must 
set to zero the imaginary part of the complex time function. 
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This approach is inefficient in that the computer program 
will still perform the multiplications for the zero imaginary 
parts. In following sections two techniques for using this 
imaginary part of the complex time function to more effi- 
ciently compute the FFT of real functions are described. 

2.7 FFT OF TWO REAL FUNCTIOWS SIMULTANEOUSLY [l] 

For efficient computer program, it is desired to compute 
the DFT of the real time functions h{k) and g(k) from the 
complex function 

y(k) = h{k) + jg(k) 

That is* y{k) is constructed to be the sum of two real 
functions where one of these real functions is taken to be 
imaginary. After computing DFT of y(k), the real and 
imaginary parts of the DFT are decomposed. Thus the simul- 
taneous discrete transform of two real time functions can 
be accomplished. 

2.8 TRANSFORM OF 2N SAMPLES WITH N SAMPLE TRANSFORM [l ] 

The imaginary part of the complex time function can 
also be used to compute more efficiently ;he DFT of a 
single real time function. Consider a function x(k) which 
is described by 2N samples. We wish to break the 2N point 
function x{k) into two N sample functions. We devide x(k) 
as follows ; 
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h(k) = x(2k) 

g(k) = x{2k+1 ) k = 0,1, 

yCk) = h(k) + jg(k) 

That is, function h(k) is equal to the even numbered samples 
of x{k), and g{k) is equal to the odd numbered samples. The 
computation procedure is given in Fig 2.3. 

A general FFT flowchar-^- for N complex input points 
based on the algorithm of section 2.1 is shown in Fig 5.2. 
The flowchart has been implemented for our system. Since 
the input points are real, imaginary parts are made zero 
at input stage. The algorithm described in section 2,B is 
more efficient for the imaginary parts of the complex time 
function can also be used to compute DFT of a real time 


function 
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1. Function x{k) is real k = 0,1, , . . . 2N-1 

2. Divide x(k) into two functions 

h{k) = x{2k) k * 0,1 , . .N-1 

g(k) = x(2k+1) 

3. Form the complex function 

y(k) = h{k) + jg(k) k = 0,1 N-1 

4. Compute 

Y(n) = y(k) 

k=0 


= R(n) + jl(n) n = 0,1, ...N-l 

where R(n) and I{n) are the real and imaginary parts 
of Y{n), respectively. 


5. Compute 

Xr{n) 


, j-R^ ^ ^ j-iiai ^ jMj 

. Sin ^ N-1 


Xi(n) = 


J(n) I(N-n)' 


fllnl , liN-nll 

12 ^ 2 J 

- Sin ^ ^ 

12 ^ 2 i 


Cos 


21 

N 




.N-1 


where Xr(n) and Xi(n) are real and imaginary parts of 
2N point of X ( k) . 


Fig 2.3: Computation procedure for DFT of a 2N point 
function by means of an N point transform. 
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CHAPTER 3 

SYSTEM DESIGM CONSIDERATIONS 

3.1 CONSIDERATIONS IN FFT HARDWARE IMPLEMENTATION [z] 

One needs to examine the following four factors : 

(a) The reasons for building special-purpose-hardware. 

(b) What are the various options available, so far as 
machine organization is concerned. 

{c) Trade-offs between cost, speed and accuracy, 

(d) Interface units etc. 

3.2 SYSTEM BLOCKS 

The block diagram of the system is shown in Fig 3,1. 

The system consists of three main blocks: (1) data aquisition 
system, (2) FFT processor and (3) display system. 

3.3 FFT PROCESSOR [?] 

The organization of an FFT processor is usually 
dictated by the performance and cost requirements and the 
technology assumed. An N-point FFT requires ~ log 2 N basic 
computations where one basic computation implies one complex 
multiplication, one complex addition and one complex subrac- 
tion. Depending upon the number of basic computations done 
in parallel, four families of machine organizations are 
commonly used. 

(a) SEQUENTIAL PROCESSOR: 

The main features of the sequential processor 
are ; 
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FIGt 3-1 SYSTEM GLOCK D{A(5^RAM 
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(i) One Arithmatic unit 

(ii) •log2N operations sequentially. 

(iii) Maximum sampling rate for real time processing 

~ 2 T^~ 1 ) t ^ ~ ^sec, time required by a basic 

computation . 

(iv) Execution time = ~»log 2 ^* 1 ' ^sec. 

(b) CASCADE PROCESSOR 

The sequential processor does not yield speeds 
generally required for real time processing of 
signal. Its performance can be improved by 
introducing a certain amount of parallelism. 

The main features of a cascade processor are; 

(i) 0092^1 arithmatic units for log2N opera- 
tions in parallel. 

(ii) operations sequentially. 

(iii) Maximum sampling rate allowable = MHz, 
t = psec. 

(c) THE parallel ITERATIVE PROCESSOR 

A second alternative for improving the perfor- 
mance of a sequential processor is to introduce 
parallelism within each iteration i.e. performing 
all the computations within an iteration in 
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parallel and log^N iterations sequentially. 

Such an organization is known as parallel 
iterative. The main features of this organi- 
zations are ; 

{i) ^ Arithmetic units for ^ operations in 

parallel . 

(ii) 1092^ operations sequentially. 

N 

(iii) Max, sampling rate allowable = ^ MHz 

(iv) Execution time = (1092!^) psec. 

{d) THE ARRAY ANALYSER 

r I 

The organization in which ■^•Tog2^i basic compu- 
tations are done in parallel is known as an array 
analyser. Due to high degree of parallelism, it 
yields a very high through-put rates. However, 
the cost of this approach limits its application. 
Its main features are : 

(i) ^.lDg2N a,u. for ■^.log2N operations in 
parallel , 

{ii) Max, sampling rate allowable = ^ MHz 

(iii) Execution time = t/Jsec. 

Reviewing the processors as mentioned above the 
se'quential processor has been selected. It is slowest 
but cheapest. This processor can be designed with micro- 
processor. A microprocessor has been used because of 
flexibility and cost. 
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SYSTEM DESIGN CONSTRAINTS [sj 

a) REAL TIME REQUIREMENT 
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Suppose we want to go upto VLF/ELF or speech' 
signals of SKHZt the minimum sampling rate will 
be 10 KHz {minimum sampling rate is double of the 
input signal frequency). If a 1024 point transform 
is chosen, a time of 0.1 sec will be available for 
input, processing and output for real time compu- 
tation . 

b) MEMORY REQUIREMENTS 

(i) Program memory 

(ii) SINE, COS values for computation of weight 

(iii) Values for windowing ® N 

(iv) Data memory = N or 2N depending on algorithm. 
Here N is number of sample points. 

c) COMPUTATION REQUIREMENTS 

1 . 

(i) ~ log 2 N complex multiplications or 2N log^N 
real multiplications 

(ii) N log 2 N complex additions ox subtraction or 
2N log 2 N real additions or subtraction. 

(iii) Bit reversal required for calculation of 
weights and for scrambling input/output 
depending upon the algorithm used. 

d) ACCURACY: 

Minimum of 1 6-bit processing accuracy is required. 
The detail is given in section 3.7. 
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3.5. CHOICE OF MICROPROCESSOR 

An 8x8 bit multiplication sub-routine on 8080 
processor takes about 200 micro-seconds. Since we are 
doing 16 bit processing, we will need to execute 8x6 bit 
sub^routine four times. So time per 16x16 bit multiplication 
will be 000 micro-seconds. For 1092 ^ complex multiplica- 
tions or 2N 1092^1 real multiplications, total time required 
will be 

4x200x2x1024x10 = 15384 msec. 

Thus it is necessary to use an external fast multiplier 
if we want to meet the real time requirement of 0.1 sec. 

In order to do one 16x16 bit multiplication on 8080 
using external multiplier, 8 MOV instructions and 8 load 
instructions are required. Thus it will take 4000 m sec. 
to do 2N 1092^ real multiplications. These are optimistic 
estimates. 

The basic reason why 8080 is slow, is that we require 
double precision (16 bit) accuracy and 6080 is only 8 bit 
processor. The same problem will be with other 8 bit 
processors like 8085 otc. Hence it is necessary to go for 
a 16 bit processor. TMS 9980 microprocessor which is a 1 6 
bit processor, has been chosen for the project. This 
processor has in-built multiplication routine (MPY). It 
does 16x16 bit unsigned multiplication. 
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The multiplier-interfacing to the system (TM 990/189) 
is not advantageous because the time taken for one multi- 
plication by interfaced multiplier is more than that of 
in-built multiplication routine. One MPY instruction takes 
62 cycles. The execution time for one MPY instruction can 
be computed ^53 as follows ; 


T 

where 

T 

tc{0) 

C 

W 

M 

Here 

t^(0) 

c 

M 

W 

Hence T 


t ( 0 ) (C+W.M) 
c 

total instruction time 
clock cycle time 

number of clock cycles for instruction execution 
plus address modification 

number of required wait states per memory access 
for instruction execution plus address modifi- 
cation. 

number of memory accesses 

0.400 micro-seconds 
62 
10 
0 

0.400 (62 + 0.10) 

24.8 micro-seconds 


If we interface the fast multiplier chip with the system 
(TM 990/189) through PSI, one LDCR and one STCR instructions 
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along with other instructions will be used for one multi- 
plication. We consider only LDCR and STCR instructions. 
The execution time for these instructions can be computed 
as follows : 

For LDCR; C«58,M*6, W=0 
= 0.400 (58 + 0-6) 

» 23.2 micro-seconds 
For STCR, C « 68, M « 8, W = 0 
Tg « 0.400 (68 + 0.6) 

* 27.2 micro-seconds. 

T =* Tj^ + Tg = 23.2 + 27.2 = 50.4 micro-seconds 

Hence it can be concluded that the interfaced multiplier 
takes double time than that of in-built multiplier. There 
fore we have used in-built multiplication routine. 
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3.6 DATA ACQUISITION AND DISPLAY SYSTEMS 

The schemes for both the data acquisition and 
the display systems are shown in Fig 3.2. These schemes 
have been implemented and tested. 

The PSI plays the key role for operating both the 
systems. It generates the control signals via software 
of the microprocessor system. The pins P8(LSB) through 
P15(MSB) of the PSI have been used as input output lines. 

The input analog signal after sampling is digitized 
by analog to digital converter (ADC). The start convert 
pulse to ADC is generated by the monostable circuit. The 
trigger input to monostable is generated by pin P6 of the 
PSI, The EOC pulse of ADC controls the switch of the 
sample and hold. The outputs of ADC are latched and are 
read and stored in the system memory by the microprocessor. 
During this operation the pin P5 is high and enables the 
latch Li and the buffer B2. So the display system remains 
off. 

The microprocessor performs the FFT computations 
of the stored input points. The output points are again 
stored in the same memory locations where input sample 
points were stored. To display these points the pin P5 
becomes low which enables the latch L2 and the buffer B3. 

So the data acquisition system remains off. Now the clocks 
at P6 goes to the counter Cl . The output points appear on 




Fibi 1’2 : SCHFn£ i-oi^ data AcaursiT/oN and dieplav systems 
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the I/O pins of the PSI one by one at each clock. The 
8-bit output of each FFT point is converted into analog 
form by digital to analog converter (DAC) and it goes to 
the Z-axis of the oscilloscope. At the same time the 
output of the counter Cl through DAC is displayed on the 
Y-axis of the oscilloscope. Since there is no clock to 
the counter C2, the output that goes to the X— axis of 
the oscilloscope is zero. Since the FFT computation is 
only for N=32, one clock appears at the counter C2 after 
the display of one set of 32 FFT points. This clock will 
shift the beam point right on the X-axis and the same 
operation is repeated and the second set of FFT points 
are displayed on the screen and so on. The intensity 
on the Y-axis displays the magnitude of the FFT points. 

The timing diagrams for the data acquisition and 
the display systems are shown in Fig 3.3(a) and 3.3(b), 
The chapter four and five describe respectively the 
hardware and software part of the data acquisition and 
the display systems. 

3.7 EIGHT BITS VERSUS SIXTEEW BITS ACCURACY 

With 8 bits in fractional part the precision of 
8 bits is given by 

Precision (6 bits) = 8 x log^gZ = 2.41 digits 

By increasing the number of bits in the fractional 
part a more accurate representation for the spectrum can 
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Fig 3.3(a) Timing diagram of the Data 
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be obtained. Also computation time considerations 
dictate that the maximum number of bits be limited. 

With 16 bits, sufficient accuracy of representation 
can be obtained without degrading the computation time 
to a large extent. 

With 16 bits, precision of 1 6 bits can be 
calculated as 

Precision (16 bits) = 16 x log^g2 = 4.82 digits 

This is quite sufficient for the envisaged use. There- 
fore 16 bits accuracy for the computation of FFT has been 
chosen for our project. 
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CHAPTER 4 

DESCRIPTION OF THE SYSTEM 
4.1 MICROPROCESSOR KIT 

The system used is Texas Instruments TM 990/189 [5^ . 

It is a Self-contained, single-board microcomputer system. 

The system features include: 

- TMS 9980A microprocessor. 

- 1024 bytes of RAM expandable on board to 2048 

- Bytes (each byte contains B bits of data) 

- 4096 bytes of ROM expandable on board to 6144 bytes 

- 2 MHz crystal controlled clock 

- Audio cassette interface 

- 16 bit programmable I/O port and interrupt monitor 
(TMS 9901) 

- 45 - key alphanumeric keyboard 

- Ten-digit, seven-segment L.E.D. type alphanumeric display 

- Visual and acoustic indicators 

- Resident system monitor and assembler 

- Single step instruction execution 

In addition to onboard memory expansion, two other system 
expansion options are available : 

- A TMS 9902 asynchronous communications controller, and 
accompanying interface circuits for either RS— 232-C or 
20 mA current loop terminals can be added. 
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- The bus can be expanded by use of the Bus Interface. 

Figure 4.1 shows the system architecture along with the 
user options. 

4.1 .1 . C.P.U. 

The TM5 9960A is a software-compatible member of 
TI’s 9900 family of microprocessors. The key features are; 

- 16 Bit Instruction word 

- Full Minicomputer Instruction Set Capability Including 
Multiply and Divide (multiplication and division) 

- Upto 16,384 Bytes of Memory 

- 8 Bit memory data bus 

- Advanced Memory to Memory Architecture 

- Separate Memory, I/O, and Interrupt. Bus Structures 

- 16 General Registers 

- 4 Prioritized Interrupts 

- Programmed and DMA I/O capability 

- On-chip 4 phase clock Generator 

- 40 pin package 

- N-Channel Silicon-Gate Technology 

A word is defined as 1 6 bits or 2 consecutive bytes 
in memory. The words are restricted to be on even address 
boundaries, i.e. the most significant half (8 bits) resides 
at even address and the least significant half at the 
subsequent odd address. A byte can reside at even or odd 


address . 
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The microprocessor signals are shown in Fig 4,2. The 
TMS 9980A CRU Interface Timing is shown in Fig 4,3. 

4.1.2 PROGRAMMABLE SYSTEM INTERFACE (PSI) 

It is a multifunctional component designed to provide 
low cost interrupt and I/O ports and an interval timer for 
TMS 9900 — family microprocessor systems. The key features 
are s 

- 9900 - family peripheral 

- Performs interrupt and I/O Interface functions: 

(i) Six dedicated interrupt lines 

(ii) Seven dedicated I/O lines 

(iii) Nine programmable lines as l/O or interrupt 

(iv) Up to 15 interrupt lines 

(v) Up to 22 input lines 

(vi) Up to 16 output lines 

- Easily cascaded for expansion 

- Interval or Event Timer 

- Single 5v power supply 

- All inputs and outputs TTL-Compatible 

The TMS 9901 Pin assignments and functions are shown 
in Fig 4,4. The TMS 9901 can be divided into three sub- 
systems: CRU interface, input/output interface and interrupt 
interface. 

a) CRU INTERFACE 

The CPU communicates with the TMS 9901 PSI via the CRU. 
The typical TMS 9901 application is shown in Fig 4.5, The 
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IMS 9901 occupies 32 bits in CRU read space and 32 bits in 
CRU write space. The Table 4,1 shows the mapping for CRU 
bit addresses to TMS 9901 function. The CRU interface 
consists of five address select lines (S0-S4) , chip enable 
(CE") and the three CRU lines (CRUIN, CRUOUT, CRUCLK), In 
the case of a write operation, the TMS 9901 strobes data off 
the CRUOUT line with CRUCLK. For a read operation, the data 
is sent to CPU on CRUIN line. The chip enable is generated 
by decoding the high-order address bits (A0-A9) on CRU cycles, 

CRU is a bit addressable (4096 bits), synchronous, 
serial interface over which a single instruction can transfer 
between one and 16 bits serially. Each one of the 4096 bits 
of the CRU space has a unique address and can be read and 
written. During multi-bit CRU transfers, the CRU address is 
incremented at the beginning of each CRU cycle to point to 
the next consecutive CRU bit. When a 9980 CPU executes a 
CRU instruction, the processor uses the contents of workspace 
register 1 2 as a base address. The CRU address is brought 
out on the 15-bit address bus; this means that the least 
significant bit of R12 is not brought out of the CPU. During 
CRU cycles, the memory control lines (MEMEN, WE and DBIN) are 
all inactive; MENEN being inactive (High) indicates the 
address is not a memory address and therefore is a CRU 
address or external instruction code. Also when MEMEN is 
inactive (High) and a valid address is present, address bits 
A0-A2 must all be zero to constitute a valid CRU address. 
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b) INPUT/OUTRJT INTERFACE 

Upto 16 individually controlled, I/O lines are 
available and the unused dedicated interrupt lines also 
can be used as input lines. After reset all I/O lines are 
programmed as inputs. By writing to any I/O line, that line 
will be programmed as an output line until another reset 
occurs* An output line can be read and indicates the present 
state of the pin, A pin programmed to the output mode can 
not be used as an input pin. Applying an input current to 
an output pin may cause damage to the TMS 9901 . The TMS 9901 
outputs are latched and buffered off chip, and inputs are 
buffered onto the chip, 

c) PROGRAMMABLE PORTS 

A total of nine pins ( INT7/P1 5-TWl 5/P7) on the TMS 
9901 are user-programmable as either I/O lines or interrupts. 
Any pin which is not being used for interrupt should have the 
appropriate interrupt mask disabled (mask=0) to avoid erro- 
neous interrupts to the CPU. To program one of the pins as 
an interrupt, its interrupt mask simply is enabled and the . 
line may be used as if it were one of the dedicated inter- 
rupt lines. 

d) SOFTWARE INTERFACE 

Figure 4,6 lists the TMS 9980 code needed to control 
the TMS 9901 PSI. The code initializes the PSI to an eight- 
bit input port, and an eight bit output port. 
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e) USER I/O PORT 

The detailed discription of the user I/O Port is 
shown in Fig 4,7. The user I/O Port extends from R12 CRU 
address 000<,g to 03E^g. The low order four bits {R12 CRU 
addresses 020 through 026) are also connected to the drivers 
for light emitting diodes CR1 through CR4 which illuminate 
in response to a logic one input. The individual bit assign- 
ments of the User Port are shown in Table 4,2. 

4.1.3 AUDIO CASSETTE INTERFACE (ACI) 

An audio cassette recorder can be used as a storage 
medium for programs used with the TM 990/189. Figure 4.8(a) 
shows the location of the pins for audio in, audio out and 
motor control. Figure 4.8(b) indicates the recorder and ACI 
connections. To dump memory to the cassette, the following 
instruction is used. 

< start address> <Sp> <stop address> <Sp> 

<8ntry address> <Sp> IDT= < name > <Sp> READY <Y>^ 

To load the memory from the cassette, L command is used. 

4.1.4 MEMORY ORGANIZATION . 

With fourteen address lines, the TMS 9980A can address 
upto 16,384 eight bit memory locations. In this system, a 
total of 8192 locations are dedicated to onboard devices 
and the remaining locations are reserved for off-board 

The off-board memory expansion capability is 


functions. 




User l/O Port 
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8 8.2 SVSTi-M I/O PORT 

The System I/O Port occupies CRU addresses 400*1 s through 43Ei6 «rnd consists of another 
TMS 9y01 (Uni dedicated to ont'ioard devices such as the keyboard, display, sound disc, and the 
audio oe>j»i>ette interface. Individual bit assignments for the system port ere shown in Table 8-6. 

8 8 2,1 Keybo^srd and Display Interface 

Signal flow h<»tween the IMS 9‘101 and keyboard and disfjlay is diagrammed ^ ^ iyureS-lS. UNI13UG 
sofiw.nr», routines scan both the keyboard and display thereby mimmuing the hardware required to 
di ive die 80 display segments and read the 45 keyswttches, 


TfiC* cfutplay is a twelve digit common cathode seven segment Lt.O type of whicn the middle ten 
di{pt^ uii* used. Used dlgrls are numbered left lo right from 1 to 10 Segment# mthm » d»gtt 
Ui'viynated as shown m Figure 849. 
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P2-1 Motor B END 
P2-2 Audio output END 
P2-3 Audio input END 
P2— 4 Motor A 
P2-S Audio output 
P2~6 Audio input 


Fig 4.a(a); ACI Connector Pins 


Recorder P2 Connections 

, _ . _ _ . j 

Auxiliary input P2-5 (Audio Output) and P2-2(CND) 

Earphone Output P2—6 (Audio Input) and P2-3(GND) 


Fig 4.8(b): Recorder And ACI Connections 
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provided through the Bus Expansion Interface, The system 

memory map, Fig 4.9; shows the organization of the memory 
Space. 

4,2 DATA ACQUISITION AND DISPLAY SYSTEM 

The schemes for data acquisition and display systems 
are discussed in the third chapter. The details of the 
schemes are explained in the following paragraphs. 

4.2.1 SAMPLE AND HOLD 

The sample and hold chip (SHM-IC-1) used is shown 

in Fig 4.1 0(a}. It is a self contained device requiring only 
an external holding capacitor, the value of which can be 
chosen according to the requirements of speed and accuracy. 
The required unity gain, noninverting sample and hold is 
shown in Fig 4, 10(a). 

We restrict our input signal magnitude to lie between 
+5\/ and --SV. The ADC which follows the S/H needs an analog 
input of —SV to +S\I or OV to 10V. The 100 PF capacitor 
gives an acquisition time of 2 micro seconds. To eliminate 
offset we use a 100 K ohm trimming potentiometer. 

4.2.2 ANALOG TO DIGITAL CONVERTER 

The analog to digital converter (ADC-EH8B2) £ 3 ] used 
is shown in Fig 4.10(c). The key featuresof this model are 
a 2 micro seconds conversion time»unipolar and bipolar 
operation, parallel and serial outputs and its low cost. 
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Fig 4.10(b): Pin configuration of rnonostable 
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Th ou'tput can be either in 2's complement or offset binary. 

5i iTice the input to ADC varies between +5V and -5V, Therefore 
configured the convertor to give bipolar operation. For 
si- polar operation calibration is necessary. We connected 

I 

iHi e +1 5V supply to pin 21 of the converter through a 100 ohm 

ill 

fe*c-imming potentiometer as shown in Fig 4.1Q(tl) . A precision 

i'll 

•/era It age source is connected to pin 32 and the input voltage 
ies-t to 0.020 volts. The potentiometer is adjusted such that 

e output code flickers equally between 10000000 and 1D000001 . 
'■•"le timing diagrams for the analog to digital converter are 
own in Fig 4,10(o). 

,< The start convert input signal to ADC is generated by 

I monostable circuit. The configuration of the 74121 is 
iTiown in Fig 4.1 D{b}. The output pulse is given by the 
wo rmula 

T s= Rext 

JWie start convert pulse should be at least of 100 nano 
aE 3 conds duration. We choose the pulse width to be 200 nano 
,E3econds and 

Cext = 47 PF 
Rext = 5.6 K ohm 

' • trigger pulse which generates the start convert pulse 

/ ' saL ven by PSI, generated via software of the system. 
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4.2,3 DIGITAL TO ANALOG CONVERTER 

The 8-bit digital to analog converter (DAC-08BC) fsj 
is shown in Fig 4.10(f), It provides very high speed of 
85 nano-seconds coupled with low cost and application 
flexibility. For unipolar operation, the arrangement is 
shown in Fig 4, 10(g). 


CF-.iFRAL library 

/Icc. No. 
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CHAPTER 5 
SOFTWARE DESIGN 

Ewer since the development of the first micro- 
processor in 1971t there has been a tremendous increase 
in the application of microprocessor in diverse areas 
such as process control, instrumentation, consumer products, 
data acquisition and many real time application. The micro- 
processors have their limitations in scientific and real 
time application. These limitations mainly arise because 
of the word length and the speed of available micropro- 
cessors . 

In this chapter, we will discuss the implementation 
of FFT on TMS 9980A microprocessor. 

5.1 SELECTION OF FIXED-POINT ARITHMETIC 

We selected fixed-point arithmetic over floating 
point arithmetic, because fixed point is faster than 
floating point through less accurate. In the FFT program 
utilizing fixed-point arithmetic, the input sequence is 
scaled such that it can be represented .. bits plus r- an 
and the binary point is assumed to lie to the left of the 
leftmost magnitude bit. As we move from stage to stage 
of the program, the magnitudes of the numbers in the 
sequence generally increase which may cause overflows 
during different stages of computations. To prevent over- 
flows, scaling is required in fixed-point arithmetic. 



5.2 SCALING [ 4 ] 
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The butterfly cycle of the power of two FFT 
algorithm operates on two complex numbers from the 
sequence. It takes these two numbers and produces two 
now complex numbers which replace the original ones in 
the sequence. Let X^(i) and ^^(j) the original numbers 
Then, the new pair is given by 

With the assumption that the binrary point lies 

at the extreme left, the relationship among the numbers 
iiH s ii# 

in m stage and m+1 stage is shown in Fig 5.1 . The 
outside square gives the region of the possible values, 
Rej^X^(i)J< 1 and ^^m^ ^ ^ circle inscribed in 

this square gives the region jX^(i)j< 1, The circle 

i 

inscribed in the inside square gives the region jx^^d) j< 

Now if X (i) and X (j) are inside the smaller circle, then 
in ni 

Xjjj^l(i) and will be inside the larger circle and 

hence not result in an overflow. Consequently, if we 

control the sequence at the m^*^ stage so that jX^(i) 1< 

s 'ii# 

we ere certain we will have no overflow at the m+1 stage 
However, if X (i) and X^{j) are inside the smaller square, 
then it is possible for X^^^(i) ox to be outside 

the larger square and hence result in overflow. 


* 



54 



Fig 5*1: 


Relationship between numbers in m**^ 

J. Lj 

and m+f stage. 
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Three techniques of scaling are given below : 

(i) SHIFTING RIGHT ONE BIT AT EVERY ITERATION: 

If the initial sequence X^{i), is scaled so 
that jX^{i) |< for all i and if there is a 
right shift of one bit after every iteration, 
then there will be no overflow. 

(ii.) CONTROLLING THE SEQUENCE SO THAT |x U)|<4 

If the initial sequence is scaled so that 
jx^(i)}< ^ for all i, then at each iteration 
we check jx^^Ci) j and if it is greater than 
one half for any i we shift right one bit 
before calculation throughout the next itera- 
tion . 

(iii) TESTING FOR AN OVERFLOW 

In this case the initial sequence is scaled 
so that ReCx (i)^<1 and I_[x (i)] < 1 whenever 
an overflow occurs in an iteration the entire 
sequence is shifted right by one bit and the 
iteration is continued at the point at which 
the overflow occured. 

The first technique is the simplest and easy to 
adapt for microprocessor. This method gives less accuracy 
than the other two techniques. In our system, the first 
technique has been selected. 
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5.3 ASSEMBLY LANGUAGE CODING OF FFT PROGRAM 

The in— place algorithm is slow and requires less 
memory locations compared to the natural input-output 
algorithm. Our system has only 2048 bytes of onboard RAM. 
Therefore the in-place algorithm for 32 sample points has 
been selected for our system. 

The FFT flow chart used for writing a program is 
shown in Fig 5.2. An assembly listing of the FFT program 
on the microprocessor TMS 99S0A is given in Appendix-A, 

The details of the program is carried out in the following 
sections . 

5.3.1 INITIALIZATION 


To initialize the FFT program, we do following 

No* of sample points N = 32 
Computation array X = 1 
Number of computation array r = 5 
Index of the data array K = 0 

5.3.2 WEIGHT GENERATION (W) 


The values of SINE and COSINE can be generated and 
stored in a array before entering the FFT. For this purpose 
only ^ +1 values of SINE from 0 to A/2 in a step size of 
2 A/N are needed and the other values of SINE and all the 
values of COSINE can be generated from these values. These 
N/4+1 values of SINE correspond to N/4+1 weights of W, i.e. 

wO, wN/O. 
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STAR1 



l<ij'«re S-a; I'M' computer program flow chart. 
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For example, for N = 32, 9 values of SINE are stored 

from 0 to ^/2 in a step size of 2 7( /32 shown in Fig 5.3. 

0 1 6 

These correspond to W , W , , . , Taking the left most 
bit as sign bit and others as a mantissa, the values of 
SINE will vary from 0000 0000 to 0111 1111. A set of 
nine values in hexadecimal is given in Fig 5.3. These 
values are stored in the system memory {M.A,: 0500-050C) 
before executing the program. 


Location 
in Memory 

Argument of 

SINE (Radians) 

Value of SINE 
(Hexadecimal) 

0 

0 

0000 

1 

{27r /32)x1 

18F8 

2 

(2;^ /32)x2 

30FB 

3 

(27r /32)x3 

471 C 

4 

{27r /32)x4 

5A82 

5 

(27: /32)x5 

6A6D 

6 

(27: /32)x6 

7641 

7 

(27r /32)x7 

7D8A 

8 

(27^/32)x8(= 7r/2) 

7FFF 

Fig 5.3: Values of SINE stored in 

Memory 

1 CALCULATION 

OF OTHER SINE VALUES 

(N=32) 


For calculating the values of SINE required in the 
FFT program, following procedure is adopted; 

(i) The program should check whether the index is 

^ H n-r 
^ 4 or >4 

(ii) If index ^ then use the index as it is to 

fetch the appropriate value of SINE from the array. 
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(iii) If index ^ 4 * the index is subtracted from 
"2 and this value is used as the index for 
fetching the appropriate value of SINE. 

(b) CALCULATION OF COSINE VALUES (N=32) 

The following procedure is followed 

(i) The program checks whether the value of index 



(ii) If index subtract the index from ^ and 

use this value as index to fetch the value from 
SINE array 

(iii) If index ^ subtract ~ from the index and 

USB this value as the index to fetch the value 
from SINE array. This will give appropriate 
COSINE values. 

The software is written to generate all the SINE 
and COSINE values. 

5,3.3 SUBROUTINES 

The subroutines which will be called by FFT program 
are given below : 

(a) BIT-REVERSAL ROUTINE (IBR) 

The flow chart for bit-reversal operation is shown 
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by box— 1 8 of the Fig 5.2 ♦ The assembly program for 
the routine is given in Appendix-A. 

(b) BUTTERFLY COMPUTATION ROUTINE 

The butterfly computation pattern for 8i>point FFT 
is illustrated in Fig 2,1. For an N-point FFT, the 
basic computation involves the evaluation of the 
following equations 

, p (5.1) 

“2 = °2 

where W = , P is an integer and D.j and 

are complex numbers fetched from sample locations 

I t 

in memory. D.j and D 2 axe the new values generated 
by the transformation to be stored back in the same 
locations where and were fetched from. The 

computation and D 2 from and D 2 and W in 
accordance with the equation (5.1) is the butterfly 
computation. This involves the multiplication of 
two complex numbers and the various steps in the 
computation are shown below. 

Let 

D2 = D2, D 2 j 

=* X + jY 

and W^,D 2 = A + jB 

then A =» X* ^2R ** 

B = Y.D2„ + X.D 2 J 
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1) 

Multiply 


and X, 

R1 = 

®2R* 

X 

2) 

Multiply 

°2I 

and Y, 

R2 = 

^21* 

Y 

3) 

Add A =* R1 


R2 




4) 

Compute D 

1 

1R 

°1R 

A 



5) 

Compute D 

f 

2R 

= - 

A 



6) 

Multiply 

°2R 

and Y, 

R3 = 

°2R* 

Y 

7) 

Multiply 

D__ 

21 

and X, 

R4 = 

^21* 

X 

8) 

Add B = R3 + 

R4 




9) 

Compute D 

I 

II 

= ”ii ^ 

B 



10) 

Compute D 

• 

21 


B 



11 ) 

Store 

and 

D 2 in place 

of D 

^ and D 2 


(c) OPTIMIZATION OF PROGRAM 

/ 

For real data inputs, the algorithms discussed in 
the section 2.8 is more time saving than the genera- 
lized algorithm chosen for our system in which the 
input datas are assumed complex. Since practically 
our input datas are also real, therefore we tried to 
optimize the program by exploiting the vary nature of 
the chosen algorithm. There are r(r=Log 2 N) iterations 
in the FFT program. In the first iteration, the 
weight of W is zero, which gives value of X=SIN as 
zero and Y=C0S as one. This means that the compo- 
nents in first array can be computed by simply addition 
and subtraction and the need for multiplication with 
SINE and COSINE is eliminated. 
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For example ; 

“ifi ' “iR “aR 

I 

j = 0 (Imaginary part is zero) (5.2) 

t 

D.«\ pv 5s Dji 

2R "1R ^2R 

I 

j s= 0 (Imaginary part is zero) 

The flow chart for computing butterfly cycles is 
shown in Fig 5»4. The figure indicates that for second 
iteration, the box~1 is repeated for half of the array 
as weight of W is zero. For second half of the array, 
the weight of W equals eight which gives value of X*SIN 
as one and Y=cCOS as zero. This means that the remaining 
components of the array are computed as per the equations 
given in box-2. 

Similarly for third, fourth and etc iterations, 
the box-3 is computed for any weight of W except the 
value of 0 and 8, The sequence of computations of box-S 
is given in section 5.3.3(b). 

The components of the array after computation 
through box-1, 2 or 3 are divided by 2 before storing 
back in the system memory. 

The square of the magnitude of the imaginary and 
real parts of each output value after all computation is 
computed using the following expression 

^ 
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Here we are not interested in the exact value of the 
output, so the squares of the outputs are computed. 

The analog form of these values are displayed on the 
screen through the Z-axis of the oscilloscope. 

5.3.4 SOFTWARE FOR THE DATA ACQUISITION AND THE 
DISPLAY SYSTEMS 

The software has been developed and tested on 
the microcomputer for the data acquisition and the 
display systems, A set of instructions that have been 
used for writing this program is shown in Fig 4.6. The 
used pins of PSI are selected with the help of Table 4.1 
and Table 4.2. The listing of the program is given in 
Appendix-A . 

5.4 ROUGH ESTIMATION OF EXECUTION TIME 

The rough execution time for one FFT for 32 points 
can be calculated by observing Fig 5.2 and the program 
listed in Appendix-A, We assume the average number of 
clock cycles for each instruction is 20 and the addressing 
modes are in work-space register modes . The time 
(t (Bf)) for one clock cycle is 0,400 micro-seconds. The 

W 

formula used to calculate the execution time (T) for one 
instruction is 

T = t^(0) (C+W-M) j 

C s= Clock cycles 

M = 0 (workspace register mode) 



The total time taken by butterfly routine 5 ? 
The total time taken by bit reversal routine = 
Time taken by Main program = 
Time taken by DAS = 
Time taken by Display system = 


50.00 tasec, 

145.00 msec. 
15.00 msec. 

7.00 msec. 

7.00 msec. 


Total time (approx.) 


324.00 msec. 
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CHAPTER 6 

CONCLUSION AND FUTURE WORK 
6.1 CONCLUSION 

The TMS 99BOA microprocessor assembly language 
program for computing Fast Fourier Transform for a set of 
32 points has been developed and tested on TM 990/189 
microcomputer . 

The data acquisition and the display systems were 
implemented and test-ed, A number of clocks for both the 
systems has been generated via the software of the micro- 
processor. The frequency of the clock (trigger to the 
monostable) generated . via the software for the data 
acquisition system is 7.8 KHz. So the sampling frequency 
is also 7.8 KHz. According to the Nyquist criterion, the 
sampling frequency should be at least the double of the 
input signal frequency. Therefore the maximum input signal 
frequency can be 3.9 KHz. 

The eight bits analog to digital converter and eight 
bits digital to analog converters easily available in the 
store were used in our project. The eight bits input points 
are stored in the MSB byte of a word in the aemory and after 
the FFT computation, only the MSB byte of a word is taken 
out from the system memory and displayed on the oscilloscope. 
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Hence the accuracy ctf FFT points is reduced due to round 
off error. The accuracy can be increased by using 16 bits 
ADCs and DACs. 

The execution time to compute FFT for a set of 32 
points is 324 m sec. 

Our system is not made for real time computation. 

The system samples a set of 32 points in a period of 324 
m sec, and the information contained in this period is lost# 
A real time system can be designed if we choose a high speed 
processor. 

6.2 FUTURE WORK 

The following related work to spectrum analysis is 
suggested to improve the performance: 

1 . The spectrum analyser for real time can be 
developed. 

2. The off-board memory area on TM 990/189 can be 
used. For this purpose 74L5245 buffers are needed. 
At present they were not available in the store. 

3. The 0086 or any other 16-bits microprocessor can 
be tried out in place of TMS 9980A, 

4. Parallel processor organization instead of 
sequential scheme may be tried to exploit 
parallelism in the algorithm. 

5. The algorithms discussed in sections 2.4 and 2.6 
can be tried out. 

6. The spectrum analyser can also be designed with 

CS m 4 r*. r-i rn 
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APPENDIX-A 

ASSEMBLY LISTING 

A1 

Memory Machine 

Instruction 

Comments 

Address Code 



1 . Nine values of SI 

NE from 0 to ' /2 in 

step of 2 /32 

stored 




Q7FE 

ieF8 



07F2 

30FB 



07F4 

471C 



D7F6 

5A82 



07F8 

6A6D 



07FA 

7641 



D7FC 

7D8A 



07FE 

7FFF 



2. Initial 

values 



612 

0005 


value of r 

614 

0020 


value of N 

3. Program 

for data 

acquisition system 


0510 

Q4CC 

CLR R12 


0512 

0203 

LI R3, > 630 


0514 

0630 



051 6 

04 C4 

CLR R4 


0S1B 

1D00 

5B0 0 

clock mode 

051A 

1E0F 

SBZ 15 

all I/O becomes 




in input mode 

051C 

1E00 

SBZ 0 


051 E 

020C 

LI R12, 0002 


0520 

0002 



0B22 

04E4 

CLR R4 


0524 

3244 

LDCR R4, R9 


0526 

1000 

NOP 


0528 

1D1 4 

SBO 20 

P5=1 for ADC enable 

052A 

020C 

LI R12, > 030 


052C 

0030 



052E 

1EFE 

SBZ -2 


0530 

020C 

yy LI R12, > 030 


0532 

0030 



0534 

1DFE 

SBO -2 

clock et P6 

0536 

1EFE 

SBZ -2 


0538 

1000 

NOP 


053A 

04D3 

CLR *R3 

§ 

053C 

3613 

STCR *R3, 8 


053E 

05C3 

INCT R3 

R3=R3+2 



0540 

0542 

0544 

0546 

0548 

054A 

054C 

054E 

0550 

0552 

0554 

0556 

0558 

055A 

055C 

055E 

0560 

0562 

0564 

0566 

0568 

056A 

056C 

056E 

0570 

0572 

0574 

0576 

047E 

0420 

4. Main 

0368 
036A 
036C 
036E 
0370 
0372 
0374 
0376 
0378 
037A 
037C 
037E 
0380 
0382 
0384 
0386 
0388 
038 A 


A 2 


F'^T 


0584 

INC R4 


0284 

Cl R4, 32 


001 F 



12F4 

JLE : 530 


1000 

NOP 


1000 

NOP 


0204 

LI R-1, ' 6F0 


06F0 



0201 

LI R1 , ; 630 


0630 



0200 

Li RO, -'SODO 


8000 



C0F1 

AA2 MOV *R1+, R3 

- 

2008 

COO R8, R3 


1 303 

JEQ /lAO 


CD03 

MOV R3, *R4+ 


D4F4 

OLR ->‘R4 + 


1003 

JMP AA1 


0503 

AAO NEE R3 


CD03 

MOV R3, *R4+ 


0734 

SETO *R4+ 


04F4 

AA1 OLR *R4+ 


04F4 

OLR *R4+ 


0284 

01 R4, ^7F0 


07F0 



1 1 F2 

JLT AA2 


1000 

MOP 


1083 

JMP >47E 


10D0 

JMP >420 


10A3 

JMP -368 


program 



02E0 

LWPI . 670 


0670 



0201 

LI R1 , 1 

/ =1 

0001 



0209 

Ll R9, >612 


0612 



C039 

MOV *R9+, RO 

r=5 

0403 

OLR R3 


01 43 

MOV R3, R5 

KQ 

01 39 

MOV *R9+, R4 


01 84 

MOV R4, R6 


0606 

DEC R6 

R6 = N-1 

091 4 

SRL R4, I 

N2 = N/2 ^ 

CE44 

MOV R4, *R9+ 

N2, MA=61 

0600 

DEC RO 

r=1=NU1 

8801 

LI C R1 , @ , 612 

r 

0612 



151E 

JET Lv 



II 



A3 


038C 

0202 

L2 LI R2, 1 

R2=I=1 

038E 

0001 



0390 

C0C5 

L3 MOV R5, R3 

R3=K 

0392 

0260 



0394 

0000 

Cl RO, 0 


0396 

1301 

JEQ A1 

K 

0398 

0903 

SRL R3, 0 

jVj 2: 

Mill 

039A 

C803 

A1 MOV R3, 0 > 6B2 

2 IV w 1 

039C 

06B2 



039E 

0420 

BLWP li >340 


03A0 

0340 



Q3A2 

0420 

BLWP @ >1 50 


03A4 

01 50 


1 w 

03A6 

0585 

INC R5 

K = K+1 

03A8 

C285 

MOV R5, RID 

K-^ R1 0 

03AA 

81 02 

C R2, R4 

I=N2 

03AC 

1302 

JEQ L20 


03AE 

0582 

INC R2 

1 = 1+1 

03B0 

10EF 

JMP L3 


03B2 

A284 

L20 A R4, RIO 

K=K+N2 

03B4 

C1 4A 

MOV RIO, R5 


03B6 

01 8A 

C RIO, R6 

K < N-1 

03B8 

11E9 

JLT L2 


03BA 

04C5 

CLR R5 

K=0 

03BC 

0600 

DEC RO 

NU1 -1 

03BE 

0914 

SRL R4, 1 


03C0 

C804 

MOV R4, @ > 61 6 

N2=N2/2 

03C2 

061 6 



03C4 

0581 

INC R1 

tl 

03C6 

10DF 

JMP LI 


03C3 

C805 

LY MOV R5, © >6B2 


03CA 

06B2 


t 

03CC 

0420 

BLWP @ >340 

^=IBR(K) 

03CE 

0340 



03D0 

C1E0 

MOV @ >61 C, R7 

i "i R7 

03D2 

061 C 



03D4 

8147 

C R5, R7 

i < K 

03D6 

121A 

JLE L2 


03D8 

C285 

MOV R5, RIO 

R5=R1 0=K 

03DA 

0A3A 

SLA RIO, 3 

8K 

03DC 

0209 

LI R9, >6F0 

Addr. of 


03DE 

06F0 

03EQ 

A289 

03E2 

C03A 

03E4 

C07A 

03E6 

COFR 

03E8 

C11A 

03EA 

C207 

03EC 

OA30 

03EE 

A209 

03F0 

064A 

03F2 

064A 


A R9, RIO 
MOV *R10+, RO ) 
MOV *R10 + , R1 ) 
MOV *R10+, R3 ) 
MOV *R10, R4 ) 
MOV R7, R8 
SLA R8, 3 
A R9, R8 
DECT R10 
DECT RIO 


{6FO+8K) — R1 0 
T3 X(K) 
i R 8 

{8i+6F0) RB 



A4 


03F4 

D64A 

DEOT RIO 

03F6 

0EB8 

MOV ♦Ra+, *R10+ 

03FB 

0EB8 

MOV *R8+, *R10+ 

03FA 

O^^BB 

i^OV *R8 + , *R10+ 

03FC 

0698 

MOV *R8, *R10 

03FE 

0648 

DECT R8 

0400 

0648 

DECT R8 

0402 

0648 

DECT R8 

0404 

OEOO 

MOV RO, *R8+ 

0406 

0E01 

MOV R1 , ♦RB+ 

0408 

CE03 

MOV R3, *R8+ 

040A 

0604 

MOV R4, '»R8 

040C 

81 85 

LZ Cv R5, R6 

040E 

1 302 

JEQ LP 

0410 

0585 

INC R5 

0412 

10DA 

IMPLY 

0414 

1000 

LP NOP 

5. Program for 

display system. 

0416 

0201 

LI R1 , > 6F0 

0418 

06F0 


041 A 

0205 

LI R5, >630 

041 C 

0630 


041E 

1001 

JMP S+4 

0420 

10A3 

JMP 368 

0422 

00B1 

AO MOV *R1+, R2 

0424 

1000 

NOP 

0426 

0002 

MOV R2, RO 

0428 

3680 

MPY RO, R2 

042A 

0501 

INCT R1 

042C 

OOF1 

MOV *R1+, R3 

042E 

1000 

NOP 

0430 

0003 

MOV R3, RO 

0432 

1002 

JMP >438 

0434 

1031 

JMP >498 

0436 

1080 

JMP >338 

0438 

3800 

MPY RO, R3 

043A 

A083 

A R3, R2 

0430 

0A22 

SLA R2, 2 

043E 

CD42 

MOV R2, *R5+ 

0440 

0501 

INCT R1 

0442 

0281 

01 R1, >7F0 

0444 

07F0 


0446 

11 EE 

JLT AO 

0448 

1000 

NOP 

044A 

1000 

CO NOP 

0440 

0203 

LI R3, >630 

044E 

0630 


0450 

0404 

CLR R4 

0452 

0200 

LI P12, >02A 


X(i) T5 
Unscrainbled values 


X 


2 



+Y 


2 



A5 


0454 

002A 

0456 

1E00 

0458 

1000 

045A 

020C 

045C 

0030 

045E 

1EFE 

0460 

020C 

0462 

0030 

0464 

1DFE 

0466 

1EFE 

0468 

1000 

046A 

321 3 

046C 

05C3 

046E 

0584 

0470 

0284 

0472 

0020 

0474 

12F5 

0476 

1000 

0478 

1000 

047A 

104A 


6. Sub-routine for 


01 50 

0690 

0152 

01 54 

01 54 

C060 

01 56 

067A 

01 58 

C0C1 

01 5A 

0A31 

01 5C 

1000 

01 5E 

0202 

01 60 

06F0 

01 62 

A042 

01 64 

AOEO 

01 66 

061 6 

01 68 

0A33 

01 6A 

A0C2 

01 6C 

0202 

01 6E 

07FD 

01 70 

C2A0 

01 72 

0672 

0174 

028A 

01 76 

0001 

0176 

162A 

01 7A 

Cl 31 

01 7C 

Cl 91 

01 7E 

0208 

01 80 

8000 

0182 

21 88 

0184 

1601 


SBZ 

NOP 

LI R12, >030 


SBZ -2 

YY LI R12, >030 

SBQ -2 
SBZ -2 
NOP 

LDCR *R3, a 
INCT R3 
INC R4 
Cl R4, 32 

JLE >460 

NOP 

NOP 

JMP >510 

Butterfly cycles. 

LWPI > 690 

MOV @ > 6TA, R1 

MOV R1 , R3 
SLA R1 , 3 
JMP 1+2 
LI R2, >6F0 

A R2, R1 
A ii >616, R3 

SLA R3, 3 
A R2, R3 

LI R2, > 7F0 

MOV i >672, RIO 

Cl R10, 1 

JNE aid 

A1 MOV *R1+, R4 
MOV ♦Rl , R6 
LI R8, > 8000 

COC R0, R6 
JNE B1 


P5=0 enable display 
system 


clock at P6 


R1=K 


R1=6F0+8K addr. of D1 
K+N2 

6F0+8(K+N2)= addr. 

p of D2 

addr. of 
R10= I 

-e=i 

Sign of 


If sign is +ve 



A6 


0186 

0504 


NEG R4 

01 68 

Cl 73 

B1 

MOV *83+, R5 

01 8A 

C093 


MOV *R3, R2 

01 8C 

2088 


COC RB, R2 

01 8E 

1601 


JNE B2 

0190 

0505 


NEG R5 

0192 

C1C4 

B2 

MOV R4. R7 

0194 

020A 


LI RIO, > 6B2 

01 96 

06B2 



01 98 

CE87 


MOV R7, *R10+ 

01 9A 

CE85 


MOV R5, *R10+ 

01 9C 

CE85 


MOV R5, *R10+ 

01 9E 

C684 


MOV R4, *R10 

01 AO 

0420 


BLWP @ >4C6 

01 A2 

04C6 



01 A4 

C460 


MOV @ >6BC,*R1 

01 A6 

06BC 



01 AB 

C4E0 


MOV @ >6BA,*R3 

01 AA 

06BA 



01 AC 

Cl 20 


MOV @ >6Ba, R4 

01 AE 

06Ba 



01 BO 

Cl EO 


MOV @ >6B2,R7 

01 B2 

06B2 



01 B4 

091 4 


5RL R4, 1 

01 B6 

0917 


SRL R7, 1 

01 B8 

0641 


DECT R1 

01 BA 

0643 


DECT R3 

01 BC 

C444 


MOV R4, *R1 

01 BE 

C4C7 


MOV R7, *R3 

01 CO 

0380 


RTWP 


D2R 

ii§n of D 2 r 


Dl R 


To addition routine 


01C2 028A A10 Cl R10, 2 

01 C4 0002 

01 C6 161 A JNE A15 


01C8 

C2A0 

MOV @ >61C, R10 


01 CA 

061 C 



01 CC 

028A 

Cl RIO, 0 


01CE 

0000 



01D0 

13D4 

JEQ A1 

t 

01D2 

Cl 31 

MOV *81+, R4 

°1R 

01D4 

0914 

SRL R4, 1 

01 D6 

0SC1 


1 

01D8 

Cl 73 

MOV *R3+, R5 

®2R 

01DA 

0915 

SRL R5, 1 

01DC 

CC45 

MOV R5, *R1+ 


01DE 

C453 

MOV *R3, *R1 

01 EO 

0551 

INV *R1 


01 E2 

C2B3 

MOV *R3+, RIO 


01 E4 

CCC5 

MOV R5, *R3+ 

°2i 

01 E6 

C4CA 

MOV RIO, *R3 

01 E8 

0641 

DECT R1 


01 EA 

0641 

DECT R1 




□ 1EC 

0643 


DEOT R3 


OIEE 

0643 


DEOT R3 


01 FO 

C4D1 


MOV *R1 , *R3 

Sign of D"^^ 

01F2 

0641 


DEOT R1 

01 F4 

0643 


DEOT R3 


01F6 

0404 


MOV R4, *R3 

D" 

2R j,., 

^1R 

01 F8 

0444 


MOV R4, *R1 

01 FA 

0380 


RTWP 

01 FC 

02A0 

A1 5 

MOV @ >610, RIO 


01 FE 

0610 




0200 

028A 


01 RIO, 0 


0202 

0000 




0204 

1 3BA 


JEQ A1 

P=0 

0206 

028A 


01 RIO, 8 


0208 

0008 




020A 

13E3 


JEQ A2 

P=8 

D20C 

1 567 


JGT A20 

P > 8 

020E 

0209 


LI R9, 8 


0210 

0008 




0212 

624A 


S RIO, R9 

(N/4-Index) 

Addr. for COS valu 

0214 

0609 


DEC R9 


0216 

0A19 


SLA R9, 1 


0218 

A242 


A R2, R9 

Addr. of X=COS 

021A 

0119 


MOV *R9, R4 


0210 

3913 


MPY *R3, R4 

®2R-X 

021 E 

0A14 


SLA R4, 1 

0220 

060A 


DEC RIO 


0222 

0A1 A 


SLA RIO, 1 


0224 

A282 


A R2, RIO 

Addr. of Y=SiN 

0226 

C15A 


MOV *R10, R5 


0220 

3973 


MPY *R3+, R5 

^2R-Y 

022A 

0A1 5 


SLA R5, 1 

0220 

0503 


INCT R3 


022E 

01 9A 


MOV *R10, R6 


0230 

3993 


MPY *83, R6 

D21.Y 

0232 

0A1 6 


SLA R6, 1 

0234 

01D9 


MOV *R9, R7 


0236 

39F3 


MPY *R3+, R7 

D21.X 

0238 

0A17 


SLA R7, 1 

023A 

0208 


LI R8, > 8000 


0230 

8000 



Sign of D 2 J 

023E 

0253 


MOV *R3, R9 

0240 

2248 


COO R8, R9 

0242 

1602 


JNE A30 


0244 

0506 


NEG R6 


0246 

0507 


NEG R7 


0248 

0643 

A30 

DECT R3 


024A 

0643 


DECT R3 


0240 

0253 


MOV 4^R3, R9 

Sign of D 2 pj 

024E 

2248 


COO R8, R9 



0250 

0252 

0254 

0256 

0336 

0434 

0498 
049A 
0490 
049E 
04A0 
04A2 
04A4 
04A6 
04A8 
04AA 
04AC 
04AE 
04B0 
04B2 
04B4 
04B6 
04Ba 
04BA 
Q4BC 
04BE 
04 00 
0436 
0338 
025E 
0260 
0262 
0264 
0266 
0268 
026A 
0260 
026E 
0270 
0272 
0274 
0276 
0278 
027A 
0270 
027E 
0280 
0282 


A8 


1602 


JWE 

A31 


0504 


NEG 

R4 


0505 


NEG 

R5 


106F 


JMP. 

>336 


107E 


JMP 

> 434 


1031 


JMP 

>498 


2108 


coo 

R8, R7 


1602 


JI\iE 

B1 


2148 


coo 

RB, R5 


1602 


JNE 

B2 


6105 

B1 

5 R5, R7 


1003 


JMP 

B3 


0507 

B2 

NEG 

R7 


A1 05 


A R5, R7 


0507 


NEG 

R7 


2108 

B3 

000 

R8, R4 


1602 


JNE 

B4 


21 88 


000 

R8, R6 


1302 


JEO 

B6 


A106 

B4 

A R6, R4 


1003 


JMP 

B8 


0504 

B6 

NEG 

«4 


6106 


S R6, R4 


0504 


NEG 

R4 


0284 

B8 

MOV 

R4, RIO 


0147 


MOV 

R7, R5 


10BA 


JMP 

>436 


1000 


JMP 

> 338 


1092 


JMP 

> 25E 


C1B1 

A90 

MOV 

*R-r+, R6 


0251 


MOV 

*R1, R9 


2248 


000 

R8, R9 


1601 


JNE 

A50 


0506 


NEG 

R6 


0207 


LI RL >6B2 


06B2 





0D06 


MOV 

R6, *R7+ 


ODOA 


MOV 

RID, *R7+ 

CD06 


MOV 

R6, *R7+ 


0504 


MOV 

R4, *R7 


0420 


BLWF 

’ @ >400 


0406 





01AQ 


MOV 

® >6B2, 

R6 

06B2 





0120 


MOV 

@ >6B8, 

R4 

06B8 





0460 


MOV 

0 >6B0, 

*R1 

06B0 






°1R 

Sign tif D-|r 


Addition routine 


Sign of 



0284 

0286 

0288 

028A 

0280 

028E 

0290 

0292 

0294 

0296 

0290 

029A 

029C 

029E 

02A0 

02A2 

02A4 

02A6 

02A8 

02AA 

02AC 

Q2AE 

02B0 

02B2 

02B4 

02B6 

02B8 

02BA 

02BC 

02BE 

02C0 

02C2 

02C4 

02C6 

02C8 

02CA 

02CC 

02CE 

02D0 

02D2 

02D4 

02D6 

02D8 

02DA 

02DC 

02DE 

02E0 


A9 


0641 

0914 
CC44 
C4E0 
06BA 
0643 
0916 
CCC6 
1002 
1000 
1000 
05C1 
C1B1 
0251 
2248 
1601 
0506 
0285 
0207 
06B2 
0DC6 
CBOA 
0D06 
0505 
0420 
04 06 
01A0 
06B2 
0160 
0638 
0460 
06B0 
0641 

0915 
0445 
0503 

0916 
0006 
04E0 
06BA 
1000 
0380 
1000 
1000 


DECT R1 
SRL R4, 1 

MOV R4, *R1+ D» 

MOV i >6BA, *R3 sign of 

DECT R3 
SRL R6, 1 

MOV R6, *R3+ D> 

JMP >29A 

NOP 

NOP 

INCT R1 
MOV *R1+, R6 
MOV *R1 , R9 
COO R8, R9 
jNE A60 
NEC R6 

A60 MOV R5, R10 
LI R7, >6B2 


MOV R6, *R7+ 
MOV RIO, *R7+ 
MOV R6, *R7+ 
MOV R5, *R7 
BLWP S >406 


MOV @ >6B2, R6 

MOV @ >6B8, R5 


MOV @ > 6BC, *R1 

DECT R1 
SRL R5, 1 
MOV R5, *R1 
INCT R3 
SRL R6, 1 
MOV R6, *R3+ 

MOV @ >6BA, *R3 

NOP 

RTWP 

NOP 

NOP 


Sign of D|pj 




®2I 
Sign of D’ 


21 


0209 A20 LI R9, 16 

0010 

624A S RIO, R9 (N/2-;n4px) Adcjr 

for SIW values ( 





A10 


02E2 

0609 

DEO R9 



02E4 

0A19 

SLA R9, 1 



02E6 

A242 

A R2, R9 

Addr. of 

SIN 

02E8 

0200 

LI RO, 8 



02EA 

0008 




02EC 

6280 

S RO, RIO 

( Index-IM/ 4 ) 

02EE 

060A 

DEO RIO 



Q2F0 

OA'IA 

SLA RIO, 1 



Q2F2 

A2B2 

A R2, RIO 

Addr. of 

COS 

02F4 

1000 

NOP 



02F6 

01 1A 

MOV *R10, R4 



02FB 

3913 

MPY *-R3, R4 



02FA 

QA14 

SLA R4, 1 


02FC 

0159 

MOV *R9, R5 



Q2FE 

3973 

MPY *R3+, R5 



0300 

0A15 

SLA R5, 1 


0302 

0199 

MOV *R9, R6 



0304 

0503 

INOT R3 



0306 

3993 

MPY *R3, R6 

Dji-Y 


0308 

0A16 

SLA R6, 1 


030A 

01DA 

MOV *R10, R7 



030C 

39F3 

MPY *R3, R7 

Djj.X 


030E 

0A17 

SLA R7, 1 


0310 

0208 

LI R8, >8000 



0312 

8000 




0314 

0553 

INV *R3 



0316 

0253 

MOV *R3, R9 



031 8 

2248 

COO P8, R9 



031A 

1602 

JNE A 75 



031C 

0507 

NEG R7 



031E 

1001 

JMP A 76 



0320 

0506 

A75 NEG R6 



0322 

0643 

A76 DECT R3 



0324 

0643 

DECT R3 



0326 

0553 

INV *R3 



0328 

0253 

MOV *R3, R9 



032A 

2248 

COO R8, R9 



D32C 

1 602 

JNE A 77 



032E 

0504 

NEG R4 



0330 

1001 

JMP A78 



0332 

0505 

A77 NEG R5 



0334 

1090 

JMP A31 



7. Sub 

-routine 

for Addition and Substract ion . 


04C6 

06B0 

LWPI >6B0 



0408 

040A 

LI R8, >8000 



04 CA 

0208 



0400 

8000 




040E 

2048 

COO R8, R1 





04DO 

1604 

Q4D2 

2088 

Q4D4 

1609 

04D6 

6042 

04D8 

1001 

04DA 

6042 

04DC 

2048 

Q4DE 

1602 

04E0 

0501 

04E2 

0705 

Q4E4 

04C5 

04E6 

1003 

04Ea 

0501 

04 EA 

AQ42 

04EC 

0705 

04EE 

2108 

04F0 

1 606 

D4F2 

2QC8 

G4F4 

1606 

04F6 

0504 

04FB 

6103 

04FA 

0706 

Q4FC 

1008 

04FE 

A103 

0500 

1001 

0502 

A103 

0504 

2108 

0506 

1602 

0508 

0504 

050A 

0706 

05QC 

04 C6 

05QE 

0300 


8. Sub 

-routine 

0340 

06B0 

0342 

0344 

0344 

0206 

0346 

0001 

0348 

04C7 

034 A 

C081 

D34C 

0912 

034E 

C242 

0350 

0A19 

0352 

0A17 

0354 

6042 

0356 

A1C1 

0358 

ca42 

035A 

0586 

035C 

8806 

Q35E 

0612 

0360 

12F4 

0362 

CB07 

0364 

Q61C 

0366 

0380 


JNE B1 
CQC R8, R2 
JNE B2 
5 R2, Rl 
JMP A1 , 

B1 5 R2, R1 
A1 CQC RB, HI 
JNE A 2 
NEC R1 
SETO R5 
A2 CLR R5 
JMP B3 
B2 NEC R1 
A R2, HI 
SETG R5 
B3 CQC H8, R4 
JNE B4 
COC RB, R3 
JNE B5 
NEG R4 
5 R3. R4 
SETO R6 
JMP B6 
B4 A R3, R4 
JMP A3 
B5 A R3, R4 
A3 COC R8, R4 
JNE A4 
NEG R4 
SETO R6 
CLR R6 
RTWP 


for Bit reverse (IBR)» 


LWPI 

> 6BQ 


LI R6, 1 


11=1 

CLR 

R7 


IBR=0 

L9 MOV 

R1 , 

R2 

R1=M=J1 

SRL 

R2, 

1 


MOV 

R2, 

R9 

2.J2 

SLA 

R9, 

1 

SLA 

R7, 

1 

2 • IB R 

S R9, R1 



A R1 

I, R7 


MOV 

R2, 

R1 


INC 

R6 


11 < r 

C @>612, 

, R6 

JLE 

L9 


P=61C 

MOV 

R7, 

@ > 61 C 

RTWP 





